ASYMPTOTIC FORMULA FOR THE MOMENTS OF MINKOWSKI 
QUESTION MARK FUNCTION IN THE INTERVAL [0,1]. 
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f^*) ■ Abstract. In this paper we prove the asymptotic formula for the moments of Minkowski 

question mark function, which describes the distribution of rationals in the Farey tree. 
The main idea is to demonstrate that certain a variation of a Laplace method is applicable 
in this problem, hence the task reduces to a number of technical calculations. 
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1. Introduction and main result 

The Minkowski question mark function ?(x) is defined for real numbers in the interval 
[0, 1] via the expression 



> 



?(k), ai, as, a 3 , ...]) = 2(1 - 2- ao + 2- (ao+ai) - 2-( ao+ai+a2 ) + ...), 

I> 

where x = [a , a\, a 2 , a 3 , ...] stands for the representation of x by a (regular) continued 

CSJ ■ 

fraction [7j (note that for x G [0,1) we have ao = 0). This function was proved to be 

OO ' 
O 



continuous, monotonic and singular [5]. For convenience we introduce the Minkowski 
question mark function F(x) on the whole non-negative real line by 



F([oq, a 1} a 2 , a 3 , ...]) = 1 - 2~ a ° + 2- {ao+ai) - 2-( ao+ai+a2 ) + .... 

Thus, 2F(x) =?(x) for x G [0, 1]. The function F( distribution function is uniquely 

determined by the functional equation [1] 

+ 1 if x>l, 

\ F{^) if 0<Kl. 1 

This implies F(x) + F(l/x) = 1. The number-theoretic significance of the function F(x) 
is apparent from the fact that it is exactly the limit probabilistic distribution of rationals 
in the nth generation of the Calkin- Wilf tree [1] (which is just a permutation of the Stern- 
Brocot tree; the Farey tree is a subtree of both). This tree is defined as follows. Starting 
from the root 1, each fraction generates two offsprings 

a a a + b 

— i — ► , . 

b a + b b 

l 
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FIGURE 1. Minkowski's question mark ?(x), x G [0, 1] 

Elementary considerations show that this tree contains any positive rational number once 
and only once in lowest terms [I]. First four iterations lead to 




It is important to note that the nth generation of the Calkin- Wilf binary tree consists 
of exactly those rational numbers whose elements of the continued fraction sum up to n. 
This fact can be easily inherited directly from the definition. First, if rational number 
| is represented as a continued fraction [a ,cii, ...,a r ], then the map | — > ^ maps | to 
[a + 1, ax..., a r }. Second, the map | — > ^ maps | to [0, a\ + 1, a r ] in case | < 1, and 
to [1, ao, ai, a r ] in case | > 1. This is an important fact which makes the investigations 
of rational numbers according to their position in the Calkin- Wilf tree highly motivated 
from the perspective of metric number theory and dynamics of continued fractions. We 
refer the reader to papers p], [2] and [3] for a wider overview of the properties of 
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and to a web page [10] for an exhaustive bibliography list. 
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The work pQ by the author initiated study of the moments of the Minkowski question 
mark function. Surprisingly, the only work where an analogous sequence of moments was 
considered before is [6j, where the authors consider the moments of a related singular 
function F 2 {x). The moments of the question mark function are defined by 



i 



m L = J (-^y) L dF(x) = 2 J x L dF(x) = J x L dl{x). (2) 



This sequence is of definite number-theoretical significance because 

m L =lim2 2 - n V [0,a l7 a 2 ,... } a s ] L } (3) 

n— >oo ' « 

ai+a2 + ...+a s =n 

(the summation takes place over rational numbers presented as continued fractions; thus, 
s > 1, dj > 1 and a s > 2). As an example of structural properties of the sequence mi 

we mention the following result. Let trt(i) = m u ; which is an entire function. One of 

L=0 

the main results about m(i) is that it is uniquely determined by the regularity condition 
tn(— t) e _v/ * log2 as t — > oo, the boundary condition m(0) = 1, and the integral equation 

DP 



oo 

m(-s) = (2e s -I) J m'(-t) J (2Vst) dt, s e 



(Here Jo(*) stands for the Bessel function Jo(^) = - JJT cos(^sinx) dx). This reveals 
the surprising fact that m(t) can be considered as the dyadic analogue of entire functions 
associated with Maass wave forms (9]. Also, the generating function G(z) = Y2T=i m LZ L ~ 1 
can be analytically continued to the cut plane C \ (1, oo) and it satisfies the three term 
functional equation 



- + \g(-) + 2G(z + 1) = G(z) 
z z A \zJ 



Minding the specific form of this equation, it is natural to call G(z) the dyadic period 
function. 

It is not clear whether there exists a closed form formula for the moments rriL- This 
would be greatly desirable minding the expression (J3j). On the other hand, one has the 
following asymptotic result [2J. Let C = e ~ 2V ^ = 0.189169995269+. Then the following 
estimate holds, as L — > oo: 
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Both implied constants are absolute. The aim of this paper is to find the first exact 
asymptotic term. Though from our point of view this is of inferior significance than the 
closed form formula (if the latter does exist), nevertheless, it is of definite interest too. 

Theorem 1. Let be defined by (TJ|). Then the following holds as L — > oo: 

m L = ^4tt 2 log 2 • c ■ L^C^ 1 + OiL^C^). 

Here the constant Co is given by c = /„ *(x)dar = 1.030199563382+, where * (x) is 
1— periodic function given by = 2 X (1 — F(x)) for x e [0, oo). 

Remark 1. The periodicity of is obvious from (TjQ). Numerically, \J 4tt 2 log 2 • Co = 
2.356229889908+. The sequence of moments 



oo 






is equally important. These moments represent values of higher left derivatives of G(z) 
at z = 1; more precisely, a formal Taylor expansion at z — 1 is given by 

oo 

G(z + 1) = J2 M lz L ~ 1 - 

L=l 

As it was proved in [1], one has the asymptotic formula Mi ~ L\ n^ 2 ) L • Moreover, in this 
case there exists an exact convergent asymptotic series [2]. Though sequences M L and 
mi are linearly dependent via relations 

m L = M L -Y,M s l , L>0, 

it is nevertheless significant that the same structural constant c does manifest in asymp- 
totic formulae for both of these sequences. 

Remark 2. As it is clear from the proof, other terms of asymptotic expansion can 
be calculated as well. We confine to the first term since calculations are standard 
(though tedious) with no new ideas being introduced. Note that the structural con- 
stants c s = L x s ^f(x) dx do appear in the asymptotic expansion, and the method used 
shares some similarities with the Euler-Maclaurin summation. 

Remark 3. If we start from the representation of mi via the last integral of (j^J) (which is 
the most natural), then to obtain the representation by the first integral we use the second 
equation in ([T|). Moreover, one of the core stages of the proof relies on the periodicity 
of ty(x) and so uses the first equation in (pQ). Since these two equations are character- 
istic only to the Minkowski question mark function, generally speaking, our asymptotic 
formula for the moments of F(x) is unique among similar results for other probabilistic 
distributions with proper support on the interval [0, 1]. 
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2. Proof 

2.1. Preliminary calculations. In this paper we use the notation of Landau to denote 
by B some absolutely bounded function in certain neighborhood of a variable; in our case 
B depends mostly on L and we consider the case L — > oo. As a convention, B stands 
for different function if considered in another or even the same formula. Occasionally, 
this notation is used to denote dependence on other variables and in each case it should 
be clear what variable and neighborhood is implied in a notation B. The main tool of 
the proof is a Laplace method. The latter is used in finding an asymptotic expansion 
of certain integrals depending on parameter. Though in our case certain corrections, 
amendments and variations are necessary, the main technique is standard and can be 
found, for example, in [8]. Since 1 — F(x) = 2~ x ^(x), identity ([2j) implies 

oo 

\ L fX 



m L = / (^_) d(F(x) - 1) = (F(x) - 1) 



x + 1 





oo oo 

+ L f 2~ x . X — r — ^(x)dx = L [ 2- x . X —— q( x )dx. 
J (x + l) L+1 v ' J (x + l) L+1 v 1 



Put c = log 2. The function f(x) = L ■ log-^-j- — cx achieves its maximum at x — Xq, 
where 



, in L x. - 1 + V 1 + f , [LIB 

Xo{x + 1) = — ; thus, x = , consequently x = \ 1 — -=. 

c 2 V c 2 y/L 

Let c = Jq *&(x) dx, and let us rewrite the expression for as 

oo oo 

m L = Lc / . ■ -z e*W dx + L (V(x) - c ) - - e f ^ dx = c Lg L + Lr L . 
J x{x+l) J x(x + l) 



Our specific choice in extracting cq out of *&(x) as a dominant ingredient can be motivated 
for the following reason. Since 0.9 < ty(x) < 1.2, the main weight of the integral defining 
tul (as far as *S?(x) is concerned) befalls on a certain constant in the range (0.9, 1.2). 
Moreover, it is easy to verify that for any continuous 1— periodic function T(x) one has 

r{x)e~ Ax2 dx = + 0(A~ l ) as A -> oo (if T(x) is smooth), 



1 

v/vr J T(ar) dx 

T(x)e- Ax2 dx = + O(l)asA^0 + . 

\l A 

— oo 

This can be seen empirically from the fact that in the first case the dominant weight of 
the integral is supported only in the neighborhood of 0, while in the second case this 
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interval has a length tending to infinity. As it is seen from the next subsection, we have 
the second case (a posteriori, the weight of itll is properly supported on the interval 
{■s/lJc — L 2 / 7 , yjnjc + L 2 / 7 ), and hence the correct constant is c rather then ^(x ). 

2.2. Evaluation of g L . Choose 5 = 5(L) such that jj/i —> °o and ^£73 — > 0. For example, 
henceforth we fix 5 = S(L) = L 2 ! 1 . Separate the integral defining g L into four parts: 

1 xo~S xo+S 00 

9L = J+ J + J + J ^- r) e^dx = ^ 1 + ^ + ^ + ^. (4) 

1 xo—S xo+8 

2.2.1. Evaluation of 0^3. First, f'(xo) = and f"(^o) < 0. Suppose \x — x \ < 5. Then 
the Taylor formula gives that for certain 6 X G [x , x] one has 

f(x) = f(x ) - a(x - x ) 2 + (3(x - x Q f H 2^~( x ~ x °^- 

Direct calculations show that 

L(2x + 1) c 2 (?l 2 B 

f"(x) = — — ; thus, f"(x ) = — — (2x + 1); this implies a = —= + ' • 



x 2 (a; + l) 2 ' ' v u/ L v u y ' ^ L 3/2' 

P"(x ) _ L(3x 2 + 3x + l) _ c 2 B 



6 3x 3 (x + l) 3 L L 3 / 2 ' 

f (4) (x) = 6l(— L^-l); this gives f (4) (^) = 7^. 
In the same fashion, 

1 1 



x(x + 1) £0(^0 + 1) 

where 



+ 7(2; - x ) + cr x (x - x ) , 



_ 2x + 1 2c 3 / 2 B B_ 

7 ~ ~x 2 (a;o + l) 2 ~ ~ T^ 2 + Ul 2 ' """"I 2 ' 

Now let us evaluate the value of e { ( xo \ For x — > 00, we have 

log — — = — + — + — ; this gives 

x + 1 a; 2ar x 6 

T , x L c B /—- B 

Llog u — = = + - + ^ = -VcL + 



Further, 



xo + 1 /l_ 1 , J. 2 y/Z ' " \/~L 

c 2 ^ 



— cx = — vcL + -H — -=; this gives 
2 v L 

. . . . c . B \ ,, ; r 5 x 

cxp 



v 2 v/Z/ V 
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We already obtained all necessary components to evaluate ^3. Note that 
P(x - xo) 3 = - • L e / J = o(l), - x ) 4 = o(l). 

Since e y = 1 + y + By 2 for y = o(l), the function under the integral (j4j) in the range 
(xo — 5,xo + 5) can be written as 
1 



( 7 1 1 n +7(^-^0) + -^(z-x ) 2 ) 
x(x + 1) Vx (x + 1) 1/ / 

x exp (f{x ) - a(x - x ) 2 + (3{x - x ) 3 + jjj^{ x ~ x o) 4 ) 



f(,o) e -a(,-, ) 2 . ^ + p( x _ Xo) 3 + __( x _ Xq) 4 + i±( X - Xo )6) 
1 



x (x^Til + 7(a; - a;o) + ^ (x - Xo)2 ) 



(The bound \x — x \ = BL 1 ^ 2 was used in merging j^{x — xo) 7 and jp{x — xo) 8 into 
— xo) 6 ). Thus, we have decomposed the function under integral (BJ into the sum of 
twelve functions. It is important to note that two of these functions 

e f(x ) e -a(x-x )l^ x _ ^ e f(x ) e -a(x-x ) 2 ^ _ ^3 ? 

Xo(Xo + 1) 

though being comparatively large, are odd function in (x— x ) and consequently contribute 
after integration. We are left with evaluating all the rest summands. This is a routine 
job. For example, 

xo+S 

e *(*o) 7 p J e - a{x ~ xo)2 (x - x ) A dx 

xo—S 
/ \ 1 

e 

" a ^ 

The second example: 

xo+S 



1 r ^5/4 

^/^g/a J y A e- y2 dy = B JN - 2 C' /Z = BL-^C^ L . 



xo—S 

= f( X0) 

L 3/2 a 3 

—\/aS 

Another example: 



J \yfe-V 2 dy = ££-3/2-3/2+3/2^ = ^-3/2^ 



,f(*o). 



xo+6 _ 



X (X + 
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Therefore, a direct inspection shows that all functions apart from the main term contribute 
at most BC^L' 5 / 4 into the value of g L . We are left with determining the magnitude of 
the main term. Since y/a.5 ~ c 3 / 4 ^ 1 / 28 , this yields 

y/aS oo oo 

J e~ y2 dy= J e- y2 dy + B J e~ y2 dy = yfH + £exp(-c 3/2 L 1/14 ). 
And so, the main term is given by 

<5 

1 e f(*o) L-ax 2 dx = ^ + Bexp (_ c 3/2 L 1/14A 1 1 f(*,) 

x (x + l) V V / y/axo{x + 1) 

-5 



2ixe- 2 ^<fLc-^(l + 



L V 41 

Thus, we obtain the main asymptotic term 

^3 = \/4^Iog2 • L" 3/4 C^ + SL-^C^ 1 . 

2.2.2. Evaluation oj J*\, J?2 and J?^. Trivially, 

i 



r L-l 

= / r7-T2- x dx = i52 



log ( - ) < — - + ^ for x > 1. (5) 



o 

To evaluate J^, we use a standard inequality 

x \ 11 

7 < + 77- 

<x + 1/ a; 2x 

Thus, since f(x) is an increasing function in the interval [l,xo], we have 

xo-5 

y 2 = [ — - — -e i{x) dx < log 2 • e f{x °- s) = Be { ^°- 6 \ 
J x(x + 1) 
i 

Further, using the inequality we have 

f(x -5) = L log ( X ° - ) - c(x - 5) < ^— + L - cx + c5 

\x — + 1/ Xq — 2(xq — 0) z 

L „ (L L 



cxq + B + c5 + ^ 



Xq \Xo Xq — 5 

L „ . L5 r . / 1 1 

= ---cx + B-^ r = -2v^L + B - c*' 2 L l ' u 
x xl 

(recall that S = L 2 ^ 7 ). Therefore, 

J 2 = BC^expi-c^L 1 ^) = BL-^C/ 1 . 
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In the same vein, 

f(x + 5) = -IsfcL + B- c 3 ' 2 L l l lA . 
Since f(x) is decreasing in the interval [x , oo), we obtain 

oo oo 

J, = [ -i-e f W da; < / ~r~~~T\ 

J x(x + l) J x(x + l) 

xo+S 1 

= BC^ex P (-c 3 / 2 L^ 14 ) = BL'^C^. 
Combining all the estimates for J^, J? 2 , ^3 and J^ 4 , we eventually obtain 

g L = ^47r 2 log2 • L'^C^ + BL'^C^. 

2.3. Evaluation of r L . In this subsection we can be more concise, since the method is 
the same as we used to evaluate ql- Recall that 



o 

Let 



oo 



X 

*(x) = y (*(*)- co) d*. 
o 

The definition of Co yields that &(x) is bounded (and periodic) function. Using an inte- 
gration by parts, we can rewrite an expression for r L as 

oo oo 

r L = J i>(x)h(x)e i{x) dx; thus, r L = B J h(x)e f(x) dx, 
o o 

where 

, , , 1 1 1 , L c 2x+l 



x 2 (x + l) 2 x(x + 1) 2; 2 (a; + l) 2 x(rr + 1) a; 2 (:r+l) 2 
Note that 

2^0 + 1 2c 3 / 2 B 

[Xo) xl(x + iy L */2 + L 5/2- 

As a matter of fact, we have a completely analogous integral to the one defining g^, with 
the major difference in the estimate h(xo) = BL~ 3 ^ 2 , whereas Xo ^ Q+1 ^ = BL^ 1 . Thus, 
using the same method to evaluate r L as we did with g L , one gets 

r L = BL 1 / 4 - 3 / 2 ^ = BL-^C^ 1 . 

Since m L = c Lg L + Lr L) this finishes the proof of Theorem 1. ■ 
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2.4. Final remarks. As can be inherited from the proof, the share of integrals 0^1, 
and ^4 into the exact value of itll is of order exp(— c^L 1 / 14 ). Thus, a standard 
machinery of an asymptotic expansion shows that all terms for asymptotic series of 
are supported on the neighborhood (x — 5, x + 5). More thorough inspection reveals that 

As was mentioned, every term of this expansion can be obtained by (increasingly tedious) 
computations. For example, Ai is a sum of Co and c\ = x^?(x) dx, each multiplied by 
some explicit constant (constant c\ occurs while extracting the main term of the inte- 
gral t\l). Nevertheless, from a number-theoretic point of view our main concern is the 
structure of values rather than their magnitude (though dominant) in some small 
neighborhood of Xq, and hence we have confined only in exhibiting such possibility of 
asymptotic expansion. 

We finish with providing a table for some values of constants m^. 



Sequence mi, 



L 


m L 


m* L 


1 


0.5000000000 


2.643125297 


2 


0.2909264764 


2.577573745 


3 


0.1863897146 


2.533204605 


4 


0.1269922584 


2.509329792 


5 


0.09016445494 


2.496320715 


6 


0.06592816257 


2.488147649 


7 


0.04929431046 


2.481940613 


8 


0.03751871185 


2.476544438 


9 


0.02897962203 


2.471583746 


10 


0.02266585817 


2.466982861 


11 


0.01792085923 


2.462750421 


12 


0.01430468951 


2.458897371 


20 


0.003008686707 


2.438565967 


30 


0.0006211064464 


2.425096683 


40 


0.0001622371309 


2.416702495 


50 


0.00004937221843 


2.410831724 


100 


0.0000004445933003 


2.395743861 



Here m* L = J^yz - The main result of the paper implies that the sequence m* L tends to 
the limit cq \J Air 2 log 2 = 2.3562298899+. Unfortunately, we do not have yet any evidence 
that the closed form expression exists for c (as well as for m L with L > 2). Finally, 
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we remark that the result of Theorem 1 should be considered in conjunction with linear 
relations [Tj mt = Yls=o («) ( — ^Y m s, L>0, which the sequence satisfies. 
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